Overview

Dataset statistics

Number of variables17
Number of observations12227
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 MiB
Average record size in memory136.0 B

Variable types

Numeric13
Boolean1
Categorical3

Warnings

release_date has a high cardinality: 3859 distinct values High cardinality
id is uniformly distributed Uniform
id has unique values Unique
instrumentalness has 3602 (29.5%) zeros Zeros
key has 1481 (12.1%) zeros Zeros

Reproduction

Analysis started2021-03-10 11:40:35.016994
Analysis finished2021-03-10 11:41:06.630829
Duration31.61 seconds
Software versionpandas-profiling v2.11.0
Download configurationconfig.yaml

Variables

id
Real number (ℝ≥0)

UNIFORM
UNIQUE

Distinct12227
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8094.03435
Minimum1
Maximum16227
Zeros0
Zeros (%)0.0%
Memory size95.6 KiB

Quantile statistics

Minimum1
5-th percentile789.3
Q14026
median8093
Q312180
95-th percentile15409.7
Maximum16227
Range16226
Interquartile range (IQR)8154

Descriptive statistics

Standard deviation4690.929822
Coefficient of variation (CV)0.57955398
Kurtosis-1.202594167
Mean8094.03435
Median Absolute Deviation (MAD)4077
Skewness0.001697230668
Sum98965758
Variance22004822.59
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20471
 
< 0.1%
54081
 
< 0.1%
33711
 
< 0.1%
13221
 
< 0.1%
74651
 
< 0.1%
54161
 
< 0.1%
115591
 
< 0.1%
156531
 
< 0.1%
136041
 
< 0.1%
33631
 
< 0.1%
Other values (12217)12217
99.9%
ValueCountFrequency (%)
11
< 0.1%
31
< 0.1%
41
< 0.1%
51
< 0.1%
61
< 0.1%
ValueCountFrequency (%)
162271
< 0.1%
162251
< 0.1%
162241
< 0.1%
162231
< 0.1%
162221
< 0.1%

acousticness
Real number (ℝ≥0)

Distinct2714
Distinct (%)22.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4305783602
Minimum1.04 × 106
Maximum0.996
Zeros0
Zeros (%)0.0%
Memory size95.6 KiB

Quantile statistics

Minimum1.04 × 106
5-th percentile0.00102
Q10.05895
median0.354
Q30.805
95-th percentile0.989
Maximum0.996
Range0.99599896
Interquartile range (IQR)0.74605

Descriptive statistics

Standard deviation0.3668928922
Coefficient of variation (CV)0.8520931987
Kurtosis-1.512941893
Mean0.4305783602
Median Absolute Deviation (MAD)0.3316
Skewness0.2615280658
Sum5264.681611
Variance0.1346103944
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.995156
 
1.3%
0.994119
 
1.0%
0.99382
 
0.7%
0.99171
 
0.6%
0.99268
 
0.6%
0.9949
 
0.4%
0.98945
 
0.4%
0.98643
 
0.4%
0.99641
 
0.3%
0.98438
 
0.3%
Other values (2704)11515
94.2%
ValueCountFrequency (%)
1.04 × 1061
< 0.1%
1.08 × 1061
< 0.1%
1.17 × 1061
< 0.1%
1.2 × 1061
< 0.1%
1.34 × 1061
< 0.1%
ValueCountFrequency (%)
0.99641
 
0.3%
0.995156
1.3%
0.994119
1.0%
0.99382
0.7%
0.99268
0.6%

danceability
Real number (ℝ≥0)

Distinct898
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.556352654
Minimum0
Maximum0.98
Zeros13
Zeros (%)0.1%
Memory size95.6 KiB

Quantile statistics

Minimum0
5-th percentile0.248
Q10.438
median0.569
Q30.685
95-th percentile0.827
Maximum0.98
Range0.98
Interquartile range (IQR)0.247

Descriptive statistics

Standard deviation0.175372545
Coefficient of variation (CV)0.315218313
Kurtosis-0.3514219069
Mean0.556352654
Median Absolute Deviation (MAD)0.123
Skewness-0.2899234856
Sum6802.5239
Variance0.03075552954
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.63240
 
0.3%
0.61138
 
0.3%
0.66537
 
0.3%
0.50136
 
0.3%
0.62136
 
0.3%
0.62336
 
0.3%
0.4935
 
0.3%
0.60634
 
0.3%
0.57634
 
0.3%
0.62834
 
0.3%
Other values (888)11867
97.1%
ValueCountFrequency (%)
013
0.1%
0.06083
 
< 0.1%
0.06121
 
< 0.1%
0.06221
 
< 0.1%
0.06251
 
< 0.1%
ValueCountFrequency (%)
0.982
< 0.1%
0.9781
< 0.1%
0.9741
< 0.1%
0.9711
< 0.1%
0.9681
< 0.1%

energy
Real number (ℝ≥0)

Distinct1396
Distinct (%)11.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5221287124
Minimum2.03 × 105
Maximum1
Zeros0
Zeros (%)0.0%
Memory size95.6 KiB

Quantile statistics

Minimum2.03 × 105
5-th percentile0.09496
Q10.303
median0.534
Q30.739
95-th percentile0.93
Maximum1
Range0.9999797
Interquartile range (IQR)0.436

Descriptive statistics

Standard deviation0.2624822911
Coefficient of variation (CV)0.5027156807
Kurtosis-1.055906763
Mean0.5221287124
Median Absolute Deviation (MAD)0.217
Skewness-0.09056870595
Sum6384.067767
Variance0.06889695314
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.41226
 
0.2%
0.53826
 
0.2%
0.51224
 
0.2%
0.70124
 
0.2%
0.61424
 
0.2%
0.57424
 
0.2%
0.59323
 
0.2%
0.43123
 
0.2%
0.53723
 
0.2%
0.55323
 
0.2%
Other values (1386)11987
98.0%
ValueCountFrequency (%)
2.03 × 1051
< 0.1%
7.46 × 1051
< 0.1%
0.0002611
< 0.1%
0.0002811
< 0.1%
0.001211
< 0.1%
ValueCountFrequency (%)
14
< 0.1%
0.9992
 
< 0.1%
0.9982
 
< 0.1%
0.9977
0.1%
0.9967
0.1%

explicit
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
False
10906 
True
1321 
ValueCountFrequency (%)
False10906
89.2%
True1321
 
10.8%

instrumentalness
Real number (ℝ≥0)

ZEROS

Distinct3658
Distinct (%)29.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1493205587
Minimum0
Maximum1
Zeros3602
Zeros (%)29.5%
Memory size95.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0.000115
Q30.05565
95-th percentile0.895
Maximum1
Range1
Interquartile range (IQR)0.05565

Descriptive statistics

Standard deviation0.2979543138
Coefficient of variation (CV)1.995400476
Kurtosis1.579457133
Mean0.1493205587
Median Absolute Deviation (MAD)0.000115
Skewness1.804604491
Sum1825.742471
Variance0.0887767731
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
03602
29.5%
0.91817
 
0.1%
0.86216
 
0.1%
0.89216
 
0.1%
0.90816
 
0.1%
0.89615
 
0.1%
0.92915
 
0.1%
0.89814
 
0.1%
0.89114
 
0.1%
0.89314
 
0.1%
Other values (3648)8488
69.4%
ValueCountFrequency (%)
03602
29.5%
1 × 1063
 
< 0.1%
1.01 × 1066
 
< 0.1%
1.02 × 1066
 
< 0.1%
1.03 × 1063
 
< 0.1%
ValueCountFrequency (%)
13
< 0.1%
0.9982
< 0.1%
0.9971
 
< 0.1%
0.9951
 
< 0.1%
0.9911
 
< 0.1%

key
Real number (ℝ≥0)

ZEROS

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.205201603
Minimum0
Maximum11
Zeros1481
Zeros (%)12.1%
Memory size95.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median5
Q38
95-th percentile11
Maximum11
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.526953879
Coefficient of variation (CV)0.6775825698
Kurtosis-1.274939405
Mean5.205201603
Median Absolute Deviation (MAD)3
Skewness0.01892504914
Sum63644
Variance12.43940366
MonotocityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
01481
12.1%
71464
12.0%
21336
10.9%
91262
10.3%
51170
9.6%
11037
8.5%
4923
7.5%
10842
6.9%
11825
6.7%
8732
6.0%
Other values (2)1155
9.4%
ValueCountFrequency (%)
01481
12.1%
11037
8.5%
21336
10.9%
3498
 
4.1%
4923
7.5%
ValueCountFrequency (%)
11825
6.7%
10842
6.9%
91262
10.3%
8732
6.0%
71464
12.0%

liveness
Real number (ℝ≥0)

Distinct1477
Distinct (%)12.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.201364562
Minimum0.0147
Maximum0.997
Zeros0
Zeros (%)0.0%
Memory size95.6 KiB

Quantile statistics

Minimum0.0147
5-th percentile0.0578
Q10.0962
median0.132
Q30.252
95-th percentile0.5977
Maximum0.997
Range0.9823
Interquartile range (IQR)0.1558

Descriptive statistics

Standard deviation0.1739874923
Coefficient of variation (CV)0.8640422651
Kurtosis5.314608221
Mean0.201364562
Median Absolute Deviation (MAD)0.051
Skewness2.212065103
Sum2462.0845
Variance0.03027164747
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.111132
 
1.1%
0.109112
 
0.9%
0.11112
 
0.9%
0.103111
 
0.9%
0.106110
 
0.9%
0.105110
 
0.9%
0.113109
 
0.9%
0.102106
 
0.9%
0.107102
 
0.8%
0.112101
 
0.8%
Other values (1467)11122
91.0%
ValueCountFrequency (%)
0.01471
< 0.1%
0.0151
< 0.1%
0.01621
< 0.1%
0.01661
< 0.1%
0.01851
< 0.1%
ValueCountFrequency (%)
0.9971
< 0.1%
0.9941
< 0.1%
0.9931
< 0.1%
0.9922
< 0.1%
0.9911
< 0.1%

loudness
Real number (ℝ)

Distinct8718
Distinct (%)71.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-10.66868651
Minimum-43.738
Maximum1.006
Zeros0
Zeros (%)0.0%
Memory size95.6 KiB

Quantile statistics

Minimum-43.738
5-th percentile-21.118
Q1-13.656
median-9.584
Q3-6.5715
95-th percentile-3.9239
Maximum1.006
Range44.744
Interquartile range (IQR)7.0845

Descriptive statistics

Standard deviation5.506888135
Coefficient of variation (CV)-0.5161730198
Kurtosis2.16120605
Mean-10.66868651
Median Absolute Deviation (MAD)3.4
Skewness-1.199524572
Sum-130446.03
Variance30.32581693
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-6.9016
 
< 0.1%
-7.5096
 
< 0.1%
-7.3556
 
< 0.1%
-10.6056
 
< 0.1%
-6.7846
 
< 0.1%
-4.2555
 
< 0.1%
-10.215
 
< 0.1%
-10.1995
 
< 0.1%
-10.3225
 
< 0.1%
-6.0865
 
< 0.1%
Other values (8708)12172
99.6%
ValueCountFrequency (%)
-43.7381
< 0.1%
-43.4691
< 0.1%
-42.0011
< 0.1%
-41.7861
< 0.1%
-41.5941
< 0.1%
ValueCountFrequency (%)
1.0061
< 0.1%
-0.0291
< 0.1%
-0.5741
< 0.1%
-0.7951
< 0.1%
-0.9232
< 0.1%

mode
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size95.6 KiB
Major
8487 
Minor
3740 

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters61135
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMajor
2nd rowMajor
3rd rowMinor
4th rowMajor
5th rowMajor
ValueCountFrequency (%)
Major8487
69.4%
Minor3740
30.6%
Histogram of lengths of the category
ValueCountFrequency (%)
major8487
69.4%
minor3740
30.6%

Most occurring characters

ValueCountFrequency (%)
M12227
20.0%
o12227
20.0%
r12227
20.0%
a8487
13.9%
j8487
13.9%
i3740
 
6.1%
n3740
 
6.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter48908
80.0%
Uppercase Letter12227
 
20.0%

Most frequent character per category

ValueCountFrequency (%)
o12227
25.0%
r12227
25.0%
a8487
17.4%
j8487
17.4%
i3740
 
7.6%
n3740
 
7.6%
ValueCountFrequency (%)
M12227
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin61135
100.0%

Most frequent character per script

ValueCountFrequency (%)
M12227
20.0%
o12227
20.0%
r12227
20.0%
a8487
13.9%
j8487
13.9%
i3740
 
6.1%
n3740
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII61135
100.0%

Most frequent character per block

ValueCountFrequency (%)
M12227
20.0%
o12227
20.0%
r12227
20.0%
a8487
13.9%
j8487
13.9%
i3740
 
6.1%
n3740
 
6.1%

release_date
Categorical

HIGH CARDINALITY

Distinct3859
Distinct (%)31.6%
Missing0
Missing (%)0.0%
Memory size95.6 KiB
01-01-1961
 
90
01-01-1962
 
88
01-01-1992
 
85
01-01-1998
 
84
01-01-1990
 
82
Other values (3854)
11798 

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters122270
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2256 ?
Unique (%)18.5%

Sample

1st row01-01-1947
2nd row13-11-2020
3rd row01-01-1950
4th row30-04-1974
5th row01-01-1973
ValueCountFrequency (%)
01-01-196190
 
0.7%
01-01-196288
 
0.7%
01-01-199285
 
0.7%
01-01-199884
 
0.7%
01-01-199082
 
0.7%
01-01-194582
 
0.7%
01-01-194080
 
0.7%
01-01-195879
 
0.6%
01-01-198778
 
0.6%
01-01-195678
 
0.6%
Other values (3849)11401
93.2%
Histogram of lengths of the category
ValueCountFrequency (%)
01-01-196190
 
0.7%
01-01-196288
 
0.7%
01-01-199285
 
0.7%
01-01-199884
 
0.7%
01-01-199082
 
0.7%
01-01-194582
 
0.7%
01-01-194080
 
0.7%
01-01-195879
 
0.6%
01-01-198778
 
0.6%
01-01-195678
 
0.6%
Other values (3849)11401
93.2%

Most occurring characters

ValueCountFrequency (%)
129094
23.8%
025638
21.0%
-24454
20.0%
912345
10.1%
210056
 
8.2%
83914
 
3.2%
63739
 
3.1%
73702
 
3.0%
53343
 
2.7%
33201
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number97816
80.0%
Dash Punctuation24454
 
20.0%

Most frequent character per category

ValueCountFrequency (%)
129094
29.7%
025638
26.2%
912345
12.6%
210056
 
10.3%
83914
 
4.0%
63739
 
3.8%
73702
 
3.8%
53343
 
3.4%
33201
 
3.3%
42784
 
2.8%
ValueCountFrequency (%)
-24454
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common122270
100.0%

Most frequent character per script

ValueCountFrequency (%)
129094
23.8%
025638
21.0%
-24454
20.0%
912345
10.1%
210056
 
8.2%
83914
 
3.2%
63739
 
3.1%
73702
 
3.0%
53343
 
2.7%
33201
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII122270
100.0%

Most frequent character per block

ValueCountFrequency (%)
129094
23.8%
025638
21.0%
-24454
20.0%
912345
10.1%
210056
 
8.2%
83914
 
3.2%
63739
 
3.1%
73702
 
3.0%
53343
 
2.7%
33201
 
2.6%

speechiness
Real number (ℝ≥0)

Distinct1275
Distinct (%)10.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.09767980698
Minimum0
Maximum0.968
Zeros13
Zeros (%)0.1%
Memory size95.6 KiB

Quantile statistics

Minimum0
5-th percentile0.0279
Q10.0347
median0.0456
Q30.0789
95-th percentile0.349
Maximum0.968
Range0.968
Interquartile range (IQR)0.0442

Descriptive statistics

Standard deviation0.155894608
Coefficient of variation (CV)1.595975799
Kurtosis18.11328828
Mean0.09767980698
Median Absolute Deviation (MAD)0.0139
Skewness4.10021255
Sum1194.331
Variance0.02430312881
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.032151
 
0.4%
0.031949
 
0.4%
0.034348
 
0.4%
0.036347
 
0.4%
0.028747
 
0.4%
0.030546
 
0.4%
0.032446
 
0.4%
0.033746
 
0.4%
0.033946
 
0.4%
0.033345
 
0.4%
Other values (1265)11756
96.1%
ValueCountFrequency (%)
013
0.1%
0.02231
 
< 0.1%
0.02261
 
< 0.1%
0.02282
 
< 0.1%
0.0231
 
< 0.1%
ValueCountFrequency (%)
0.9681
 
< 0.1%
0.9672
 
< 0.1%
0.9656
< 0.1%
0.9645
< 0.1%
0.96311
0.1%

tempo
Real number (ℝ≥0)

Distinct11264
Distinct (%)92.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean118.1674949
Minimum0
Maximum216.843
Zeros13
Zeros (%)0.1%
Memory size95.6 KiB

Quantile statistics

Minimum0
5-th percentile75.3244
Q195.0505
median116.915
Q3136.1085
95-th percentile174.5798
Maximum216.843
Range216.843
Interquartile range (IQR)41.058

Descriptive statistics

Standard deviation30.20006382
Coefficient of variation (CV)0.2555699759
Kurtosis-0.01355677422
Mean118.1674949
Median Absolute Deviation (MAD)20.875
Skewness0.4147768412
Sum1444833.96
Variance912.0438545
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
013
 
0.1%
128.018
 
0.1%
125.0055
 
< 0.1%
120.0055
 
< 0.1%
106.9914
 
< 0.1%
1304
 
< 0.1%
128.0074
 
< 0.1%
123.9974
 
< 0.1%
130.0294
 
< 0.1%
119.9934
 
< 0.1%
Other values (11254)12172
99.6%
ValueCountFrequency (%)
013
0.1%
39.8752
 
< 0.1%
42.491
 
< 0.1%
44.91
 
< 0.1%
46.3291
 
< 0.1%
ValueCountFrequency (%)
216.8431
< 0.1%
216.0961
< 0.1%
216.0831
< 0.1%
215.0231
< 0.1%
212.2421
< 0.1%

valence
Real number (ℝ≥0)

Distinct1256
Distinct (%)10.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5253000728
Minimum0
Maximum1
Zeros17
Zeros (%)0.1%
Memory size95.6 KiB

Quantile statistics

Minimum0
5-th percentile0.09893
Q10.321
median0.532
Q30.737
95-th percentile0.933
Maximum1
Range1
Interquartile range (IQR)0.416

Descriptive statistics

Standard deviation0.2582046985
Coefficient of variation (CV)0.4915375265
Kurtosis-1.023706984
Mean0.5253000728
Median Absolute Deviation (MAD)0.208
Skewness-0.08203179183
Sum6422.84399
Variance0.06666966631
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.96141
 
0.3%
0.96434
 
0.3%
0.96234
 
0.3%
0.96733
 
0.3%
0.96532
 
0.3%
0.96628
 
0.2%
0.5426
 
0.2%
0.96325
 
0.2%
0.35724
 
0.2%
0.9624
 
0.2%
Other values (1246)11926
97.5%
ValueCountFrequency (%)
017
0.1%
1 × 1057
0.1%
0.005541
 
< 0.1%
0.005581
 
< 0.1%
0.01541
 
< 0.1%
ValueCountFrequency (%)
11
< 0.1%
0.9931
< 0.1%
0.991
< 0.1%
0.9891
< 0.1%
0.9881
< 0.1%

year
Real number (ℝ≥0)

Distinct102
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1984.517298
Minimum1920
Maximum2021
Zeros0
Zeros (%)0.0%
Memory size95.6 KiB

Quantile statistics

Minimum1920
5-th percentile1937
Q11966
median1987
Q32008
95-th percentile2019
Maximum2021
Range101
Interquartile range (IQR)42

Descriptive statistics

Standard deviation25.91199777
Coefficient of variation (CV)0.01305707831
Kurtosis-0.8089325033
Mean1984.517298
Median Absolute Deviation (MAD)21
Skewness-0.4107160383
Sum24264693
Variance671.4316286
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2020466
 
3.8%
2018311
 
2.5%
2019288
 
2.4%
2017265
 
2.2%
2016258
 
2.1%
2015211
 
1.7%
2013199
 
1.6%
2002197
 
1.6%
1999187
 
1.5%
1998185
 
1.5%
Other values (92)9660
79.0%
ValueCountFrequency (%)
192013
0.1%
19214
 
< 0.1%
19225
 
< 0.1%
19235
 
< 0.1%
19249
0.1%
ValueCountFrequency (%)
2021111
 
0.9%
2020466
3.8%
2019288
2.4%
2018311
2.5%
2017265
2.2%

duration-min
Real number (ℝ≥0)

Distinct172
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.888132821
Minimum0.2
Maximum72.8
Zeros0
Zeros (%)0.0%
Memory size95.6 KiB

Quantile statistics

Minimum0.2
5-th percentile1.9
Q12.9
median3.6
Q34.4
95-th percentile6.7
Maximum72.8
Range72.6
Interquartile range (IQR)1.5

Descriptive statistics

Standard deviation2.383133109
Coefficient of variation (CV)0.6129248199
Kurtosis283.0939223
Mean3.888132821
Median Absolute Deviation (MAD)0.7
Skewness12.46312956
Sum47540.2
Variance5.679323415
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3.1489
 
4.0%
3.2487
 
4.0%
3.4483
 
4.0%
3458
 
3.7%
3.3457
 
3.7%
3.6457
 
3.7%
3.5449
 
3.7%
3.7434
 
3.5%
2.9419
 
3.4%
3.9404
 
3.3%
Other values (162)7690
62.9%
ValueCountFrequency (%)
0.23
 
< 0.1%
0.36
 
< 0.1%
0.45
 
< 0.1%
0.511
0.1%
0.623
0.2%
ValueCountFrequency (%)
72.82
< 0.1%
66.91
< 0.1%
62.21
< 0.1%
60.31
< 0.1%
59.31
< 0.1%

popularity
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size95.6 KiB
very low
3222 
low
3118 
average
2912 
high
2606 
very high
369 

Length

Max length9
Median length7
Mean length5.664431177
Min length3

Characters and Unicode

Total characters69259
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowvery low
2nd rowlow
3rd rowvery low
4th rowlow
5th rowaverage
ValueCountFrequency (%)
very low3222
26.4%
low3118
25.5%
average2912
23.8%
high2606
21.3%
very high369
 
3.0%
Histogram of lengths of the category
ValueCountFrequency (%)
low6340
40.1%
very3591
22.7%
high2975
18.8%
average2912
18.4%

Most occurring characters

ValueCountFrequency (%)
e9415
13.6%
v6503
9.4%
r6503
9.4%
l6340
9.2%
o6340
9.2%
w6340
9.2%
h5950
8.6%
g5887
8.5%
a5824
8.4%
y3591
 
5.2%
Other values (2)6566
9.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter65668
94.8%
Space Separator3591
 
5.2%

Most frequent character per category

ValueCountFrequency (%)
e9415
14.3%
v6503
9.9%
r6503
9.9%
l6340
9.7%
o6340
9.7%
w6340
9.7%
h5950
9.1%
g5887
9.0%
a5824
8.9%
y3591
 
5.5%
ValueCountFrequency (%)
3591
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin65668
94.8%
Common3591
 
5.2%

Most frequent character per script

ValueCountFrequency (%)
e9415
14.3%
v6503
9.9%
r6503
9.9%
l6340
9.7%
o6340
9.7%
w6340
9.7%
h5950
9.1%
g5887
9.0%
a5824
8.9%
y3591
 
5.5%
ValueCountFrequency (%)
3591
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII69259
100.0%

Most frequent character per block

ValueCountFrequency (%)
e9415
13.6%
v6503
9.4%
r6503
9.4%
l6340
9.2%
o6340
9.2%
w6340
9.2%
h5950
8.6%
g5887
8.5%
a5824
8.4%
y3591
 
5.2%
Other values (2)6566
9.5%

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

idacousticnessdanceabilityenergyexplicitinstrumentalnesskeylivenessloudnessmoderelease_datespeechinesstempovalenceyearduration-minpopularity
020150.9490.2350.0276No0.9270050.5130-27.398Major01-01-19470.0381110.8380.039819473.0very low
1159010.8550.4560.4850No0.0884040.1510-10.046Major13-11-20200.0437152.0660.859020202.4low
290020.8270.4950.4990No0.0000000.4010-8.009Minor01-01-19500.0474108.0040.709019502.6very low
367340.6540.6430.4690No0.1080070.2180-15.917Major30-04-19740.036883.6360.964019742.4low
4155630.7380.7050.3110No0.0000050.3220-12.344Major01-01-19730.0488117.2600.785019733.4average
5143840.8980.4980.4420No0.00319100.0974-9.481Major01-01-19680.0337109.6190.355019682.6low
69540.2590.6200.7580No0.0013250.4160-8.183Major13-11-19420.0343119.2580.912019422.4very low
759300.1240.8790.6280Yes0.0000010.0661-6.668Minor01-01-20050.2640150.1050.721020053.5average
8119000.1490.6970.1840Yes0.0000020.0763-23.303Minor01-01-19450.9330133.9970.613019451.6very low
9144980.4700.5870.5660No0.0000090.0644-9.932Major01-01-19990.027676.0540.529019997.7high

Last rows

idacousticnessdanceabilityenergyexplicitinstrumentalnesskeylivenessloudnessmoderelease_datespeechinesstempovalenceyearduration-minpopularity
1221725210.95200.11100.139No0.84700070.173-25.052Minor01-01-19950.0465176.4470.0835019951.2average
1221882800.94500.49200.122No0.868000110.108-19.844Minor26-11-19710.0702131.6460.5990019713.0low
12219132050.13700.40800.922No0.44700070.983-9.745Major14-11-19790.0526110.4070.3420019793.4low
12220108640.26300.65300.609No0.001010110.233-7.519Minor26-06-20010.037095.9820.4820020013.5high
12221152340.90900.43500.433No0.96300020.118-20.343Minor25-01-20110.0348179.9230.2600020111.6average
12222153430.04080.80900.801No0.00000010.353-5.461Major01-07-20140.407081.9400.7440020143.4average
1222317010.91200.45100.240No0.00000210.175-14.014Major01-01-19590.0351134.0090.7010019592.0very high
1222433510.32800.55100.564No0.00295020.352-9.298Minor01-01-19840.0338124.8830.8900019842.5low
1222588790.12200.06080.939No0.99100010.912-26.324Major09-01-20170.118073.2340.0055820173.1high
1222697110.03800.38900.768Yes0.00000010.119-4.765Major24-07-20200.256090.1460.3340020203.1high